CompareDx: a Software Toolkit for Measuring the Performance of Programs that Generate Multiple Diagnoses
نویسندگان
چکیده
Introduction Evaluations of medical diagnosis programs have been carried out for several decades but for programs which produce multiple diagnoses there is a lack of suitable, well validated performance metrics. If a program reasons about only one (or a few) types of diagnosis, then the sensitivity and specificity of the program can readily be determined given a suitable standard diagnosis. However if the program is designed to reason about the possibility of dozens or hundreds of diagnoses other metrics may be required. Evaluating such programs usually requires a considerable amount of data per case and it is therefore difficult to collect more than 100 to 200 cases. This results in sparse data with many diagnoses appearing only once or twice in the evaluation (and many diagnoses not appearing at all). Calculating sensitivity and specificity for each diagnosis is therefore impractical, and only common diagnoses can be effectively evaluated. We have refined performance metrics for assessing diagnostic accuracy, evaluated them with data from a clinical evaluation study, and developed a Java program to implement the metrics efficiently.
منابع مشابه
New approaches to measuring the performance of programs that generate differential diagnoses using ROC curves and other metrics
INTRODUCTION Evaluation of computer programs which generate multiple diagnoses can be hampered by a lack of effective, well recognized performance metrics. We have developed a method to calculate mean sensitivity and specificity for multiple diagnoses and generate ROC curves. METHODS Data came from a clinical evaluation of the Heart Disease Program (HDP). Sensitivity, specificity, positive an...
متن کاملA Two-stage DEA Model Considering Shared Inputs, Free Intermediate Measures and Undesirable Outputs
Data envelopment analysis (DEA) has been proved to be an excellent approach for measuring the performance of decision-making units (DMUs) that use multiple inputs to generate multiple outputs. But the allocation problem of shared inputs and undesirable outputs does not arouse attention in this movement. This paper proposes a two-stage DEA model considering simultaneously the structure of shared...
متن کاملThe semi-automatic parallelisation of scientific application codes using a computer aided parallelisation toolkit
The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. Historically, the lack of a programming standard for using directives and the rather limited performance due to scalability have affected the take-up of this programming model approach. Significant progress has been made in hardware and software technologies, as a result the p...
متن کاملMeasuring Iran\'s Industrial Clusters Competitiveness: Through Porterian Diamond Framework and Factors
Competitiveness is divided into two broad categories named as macroeconomic and microeconomic competitiveness. Microeconomic competitiveness indicators have a direct impact on company productivity within which sate of cluster development is the most important pillar of microeconomic competitiveness that are introduced by Michael Porter and it's framework as Diamond Model. The Diamond reveals al...
متن کاملMeasuring the Performance of the Virtual Teams in Global Software Development Projects
The development teams who are geographically spread, culturally mixed and mainly depend on information and communication technology (ICT) for communication is defined as a global virtual teams (GVTs). Despite the advancement of technologies, achieving the efficient performance of GVTs remains a challenge. The reviewed literature has highlighted the importance of training and development, organi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999